Treating 'Free Word Order' in Machine Translation
نویسنده
چکیده
In free word order languages, every s('.ni;cn(:(~ is cml)eddcd in its specific contexL The order of consl;ituenLs is d('tcrmincd l)y the categories theme, t heme and contrastive J'oc'us. This paper shows [low to recognisc mid to transhti,e these cat;egorles autom;~t;ically on ;~ serH,e.ntial basis, so that SCilI;ClIC(.' ('ml)edding; can I)c' achieved wit;houL having i;o tel'e,' 1,o l,h(', co,> t;cxt. Tradition,dly neglected modit ier ch~sses are fully covered by the proposc'd mc'Lhod. 1 I n t r o d u c t i o n Most languages known as free word order hmgv, ages are in facl; languages wil;ll parl, ially J)'ee wonl order (]';ngcN(amp el, al. 1 9 9 2 ) , or ra ther free phrase order (Schi;ufelc 1991). A diflicul ty linl(cd to the form;d de.scription of i,}w.sc languages is tha t instead o17 a (:ompletc lack of ordering rules many sul)l, le rcsi;ricl, ions aljply. A large arrlounl, of wor(t or(lcr v;triai;ions ;trc gratnmi~t;ical in isoh~t<.d sct,tc, , :es, but co,,text restr icts the numl)er o1" s(K|uen(:es wllicli are possible and ilal;l~ll:al. Ill l,}lis s(.'llSC 1 Sellfences air(', era, bedded in their context . A slw.cific context calls for a certain word order, an(l the word order of a given senl, ence rcflect, s il,s conl;exl;. : ]n 1;his p;,pe,', we present ,'ccerit suggesLions on how 1;o treat, free lJhras(, order in N;d,ural L~nguagc Processing (N],P) , and [)?(?sent an alternative' solution t,o the problem. The id&~ is t o u s e a the7nat ieal l~t .#ged, or fle:,:ihie, canonical form ( C I " ) f o r generat ion, and an algorithm to recognise the. re, levant catcgorics Ihe , te , "theme ;rod cont?'asZhm ./be'us du,'iilg ;m;dy.sis. 'l'his ,ncthod has })cc'n implcmctited successfully itl the u,tiflc;d, ion and constrainl.-basc'd M~vchine Tr;mslal,i(m sysl;em CAT2 (Sharp 71989, Sl;cinlmrgcr 1992a). I1, it> (:ludcs i;he ordering of ,nodi tiers, which arc l, radil;ionally lcfI. oul; {,, wo,'d or(It,' desc,'il)l, iot, (C.o,,lo,,/l,;vet,s 71!1!)2). All stat;e, nc,~ts in (,his pN)cr concern writi,(m hmguage, ;~s spoken h.lgu;tgjc is more ]i])er;d wil, h rcsl)eCl; to ordering.
منابع مشابه
A Hybrid Machine Translation System Based on a Monotone Decoder
In this paper, a hybrid Machine Translation (MT) system is proposed by combining the result of a rule-based machine translation (RBMT) system with a statistical approach. The RBMT uses a set of linguistic rules for translation, which leads to better translation results in terms of word ordering and syntactic structure. On the other hand, SMT works better in lexical choice. Therefore, in our sys...
متن کاملTranslating into Free Word Order Languages
In this paper, I discuss machine translation of English text into a relatively “free” word order language, specifically Turkish. I present algorithms that use contextual information to determine what the topic and the focus of each sentence should be, in order to generate the contextually appropriate word orders in the target language.
متن کاملContext-free reordering, finite-state translation
We describe a class of translation model in which a set of input variants encoded as a context-free forest is translated using a finitestate translation model. The forest structure of the input is well-suited to representing word order alternatives, making it straightforward to model translation as a two step process: (1) tree-based source reordering and (2) phrase transduction. By treating the...
متن کاملDependency Tree Abstraction for Long-Distance Reordering in Statistical Machine Translation
Word reordering is a crucial technique in statistical machine translation in which syntactic information plays an important role. Synchronous context-free grammar has typically been used for this purpose with various modifications for adding flexibilities to its synchronized tree generation. We permit further flexibilities in the synchronous context-free grammar in order to translate between la...
متن کاملAssessing the Translation of Parvin Etesami's Selected Poems Using Vinay and Darbelnet’s Model
Translators always seek to find the best equivalents for each word, sentence or phrase in the target language (TL) in order to have the most accurate and meaningful translation of the text. Generally, a translator’s main concern is whether to prefer the form over the content or vice versa. In translation studies, literal translation prioritizes the form while free translation concentrates on th...
متن کاملLexical-based Reordering Model for Hierarchical Phrase-based Machine Translation
Reordering is a critical step for statistical machine translation. The hierarchical phrasebased model (HPB) based on a synchronous context-free grammar (SCFG) is prominent in solving global reorderings. However, the model is inadequate to supervise the reordering process so that sometimes phrases or words are reordered due to the wrong choice of translation rule. In order to solve this problem,...
متن کامل